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Abstract: Many statistics available to constrain non-Gaussianity from inflation are sim- 
plest to use under the assumption that the curvature correlation functions are hierarchical. 
That is, if the n-point function is proportional to the (n—l) power of the two-point function 
amplitude and the fluctuations are small, the probability distribution can be approximated 
by expanding around a Gaussian in moments. However, single-field inflation with higher 
derivative interactions has a second small number, the sound speed, that appears in the 
problem when non-Gaussianity is significant and changes the scaling of correlation func- 
tions. Here we examine the structure of correlation functions in the most general single 
scalar field action with higher derivatives, formalizing the conditions under which the fluc- 
tuations can be expanded around a Gaussian distribution. We comment about the special 
case of the Dirac-Born-Infeld action. 
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1. Introduction 

The next decade will bring extraordinarily detailed measurements of the cosmic microwave 
background from the Planck satellite |Q. In addition, large volume surveys will catalogue 
cosmological structures (such as galaxies, galaxy clusters, and voids) whose number and 
distribution depends on the same initial conditions that explain the temperature fluctua- 
tions^. This new data will enable an increasingly precise determination of the temperature, 
polarization and matter power spectra over a widening range of scales. More importantly, 
it will also allow measurements of higher order statistics. These statistics are an excellent 
probe of inflation, in theory measuring interactions of the inflaton that can distinguish 
qualitatively different inflationary physics and perhaps truly testing inflation itself. 

^Relevant current and upcoming surveys include SDSS, BOSS, WFMOS, ADEPT, EUCLID, DES, 
PanSTARRS, LSST. 
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If the connected part of any primordial higher order correlation function is observed to 
be different from zero, then in the inflationary paradigm the inflaton was not a free field and 
its fluctuations (which source the observed temperature and matter density contrasts) were 
non-Gaussian. Clearly, at least a small deviation from Gaussianity is expected. Further- 
more, models with derivative self-interactions or multiple fields (including the alternative 
ekpyrotic scenario) can generate significant non-Gaussianity [Q, ^, ^, ^, ^, ^, ^. This is 
in contrast to standard single-field slow-roll with a smooth potential, where the level of 



non-Gaussianity is proportional to slow-roll parameters y, |10[. Such non-Gaussianity is 



so small that its detection must likely wait, even in the most optimistic analysis, at least 
until measurements of the 21cm line |11] and even so must be separated from gravity in- 



duced effects that are an order of magnitude or two larger |12, 14, Here we will 
focus on the single-field scenario where the primordial non-Gaussianity is mostly generated 
by derivative interactions and is observably large. This has the dual advantage of being 
observationally relevant now and calculationally simple. 

Much of the work on the observational implications of non-Gaussianity has so far 
focused on the CMB bispectrum (3-point function) for the excellent reason that this is 
likely to be a statistic where a deviation from Gaussianity is easy to detect. More generally, 
tools for evaluating non-Gaussianity come in two types: explicit comparison of correlation 
functions predicted by a model to the data (so far, the 3-point or bispectrum as described 
above) and interpretation of other measured statistics (such as cluster number counts 
0, 0, m, m, H, m, H, H or Minkowski functionals ||, ||). The first case is of course 
ideal if we are confident that we know the correct model. However, non-Gaussianity is an 
excellent tool for distinguishing between different inflationary physics, so it is worthwhile to 
check for non-Gaussian features of the data in a variety of ways. Statistics like Minkowski 
functionals capture information about more than one correlation function and must to be 
interpreted to give, for example, constraints on the magnitude of the 3-point function (e.g., 
/atl for the local ansatz). 

At the moment, the translation between measured statistics and inflationary theory 
depends largely on the hierarchical assumption that the n-point correlation scales like the 
two-point to the re — 1 power |26]. This assumption means in practice that one can expand 
around the Gaussian result in moments of the true distribution, as long as the amplitude of 
fluctuations is also small. It is a property of the non-Gaussianity generated by gravity after 
horizon re-entry (see e.g. [27| for a thorough discussion) and is implicit in the local ansatz 
where the parameter /^r^ originates. However, the conditions under which a fundamental 
physics model gives rise to hierarchical correlations are only beginning to be discussed. 
The three and four point correlations from terms in the potential were discussed in |28|, 
and there was earlier interest in models with dimensional scaling, where the re-point is 
proportional to the 2-point to the re/2 power (||2^ and references therein). When there are 
small parameters other than the root-mean-square amplitude in the problem, the scaling of 
the re-point with the two-point no longer ensures that a Gaussian expansion can be made. 
Here we consider 'hierarchical' to mean that the distribution of curvature fluctuations can 
be expanded in a series of moments about a Gaussian (in a way that is made explicit in 
Section l2|). As our ability to observe higher order statistics improves, it is important to be 
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clear about the structure expected from various physical processes to avoid misinterpreting 
the data. 

In this paper, we will examine the conditions under which the hierarchical assumption 
is justified for single-field models, where a second small number, the sound speed, appears in 
the correlation functions. Intuitively, the hierarchical structure in a single field model looks 
like a consequence of perturbativity, and we will show that there is indeed a single constraint 



that ensures this structure and the validity of perturbative calculations. Following p9l 30 



31, 32, 33 1, we will take an effective field theory (EFT) approach. More specifically, we 
will consider the EFT for the scalar field sourcing inflation, rather than the theory of 
the fluctuations. We study the most general single-field action and extend the results of 



31]. We do this with the goal of understanding the structure of correlation functions 
from derivative interactions, and hope that some useful features can be uncovered even if 
a complete 'EFT for inflation' is not fully understood. Field theory for inflation is being 



treated with much more care in some recent work [34, 35, 31, 36, 37| and it will be very 



interesting to verify that there is no subtlety that might change our conclusions. 

Specifically, we will consider a single field description of the canonical inflaton (p, valid 
below some scale M < Mp (the reduced Planck mass). We take the point of view that this 
field exists regardless of whether there is also some physics providing a potential suitable 
for inflation. That is, we use flat space effective field theory intuition to construct the 
derivative terms in the effective action, then consider supplementing it with an appropriate 
potential energy 1^(0) = 3MpH^ (where H is the Hubble parameter) and computing the 
resulting spectrum of scalar fluctuations. Since we will only be interested in examining the 
correlation functions in a regime where the dominant non-Gaussian contribution comes 
from kinetic terms, we will not worry about the details of V{(f)). 

Discussions of single fleld inflation have so far largely focused on actions of the form 

S = I d'x^[P{X, <P) + ^M^R] (1.1) 

where X = —^g^^d^(l)du4> [il]- For inflationary purposes (where the background solution 
is spatially homogeneous) the scalar part of the action describes an ideal fluid. This can 
be seen by computing the stress-energy tensor or more elaborately from symmetry argu- 
ments We will review the cosmology associated with this action in the next section, 
including the hierarchical structure of correlations as long as a perturbative condition is 
satisfied. However, the P{X, (/)) action is not the most general effective action for a scalar 
field (j) since it contains no higher derivative terms. At the level of four-derivative terms, it 
was recently shown that this makes no difference |31] but cases with large non-Gaussianity 



require an understanding of terms with more derivatives. We will examine the most generic 
action up to six-derivative terms and show that the same perturbative condition still main- 
tains the hierarchical structure. Extending to more derivatives uncovers no new features in 
this regard. The Dirac-Born-Infeld (DBI) action is a particularly interesting example of a 
P{X, <j)) action that contains a square-root summation of powers of X. It is the action of a 
D-brane in the limit of small acceleration (small curvature), with higher derivative correc- 
tions whose structure is known in many contexts and can be guessed for the case relevant 
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here. Because of this special structure, we will return to this example after discussing the 
more generic case. 

Higher derivative corrections are of little concern for the background inflating solution, 
H{(I)q), which anyway requires 0o ^ H(/)q, but it is less clear that these terms can be 
ignored in the correlation functions when the extra derivatives act on the fluctuations 5(j). 
Examining six-derivative terms in a generic scalar field action will uncover the condition 
under which these terms do not contribute significantly to correlation functions. 

In the next section, we further motivate our interest in the structure of correlation 
functions and review some basic notation for general single field models and inflationary 
calculations. In Section |^ we extend the analysis of |31] to include the most relevant six 
derivative terms. Section ^ discusses the general structure of higher derivative terms in the 
perturbative regime and contains the main result. We turn to the illustrative example of 
the DBI action in Section ^ and contrast the likely form of the action with the general 
formalism derived in Finally, Section ^ summarizes and concludes. Appendix ^ 

demonstrates that calculation of the correlation functions for the action expanded to four 
or six derivatives gives the same scaling arrived at in Section |^ in a quick but approximate 
way. 



2. Review of inflation from P{X, 0) 

Here we review our notation for the power spectrum and cumulants, and the hierarchical 
ansatz. We also demonstrate the scaling behavior of correlation functions from derivative 
interactions in an action whose scalar part is P[X,(j)) (Eq.( |l.lD ). 

2.1 Inflationary Fluctuations 

We consider a single-field inflationary scenario where the inflaton field is <j) = (po + 6(j) (we 
will work largely in the spatially flat gauge). For the action in Eq.( [L.l[ ), the procedure for 
quantizing scalar fluctuations proceeds just as for a field with a standard kinetic term, up 
to the appearance of an additional parameter, the sound speed, defined by 

Px + 2XPxx ^ ' 

where P^x is the partial derivative of P with respect to X [38, 4^, 41, 42| . Scalar fluctuations 
on a scale k freeze out when aH = Csk, where H = a/a and a is the scale factor. Inflation 
is a period of accelerated expansion, which occurs as long as e < 1 for 

. = (2.2) 

Fluctuations have mean zero and amplitude 

m"" = (2-3) 

2ti^CsP,x 
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which reduces to the famiUar answer for a canonical kinetic term. Using the lowest order 
gauge transformation, ( = —{H/(pQ)6(j), we can go from the spatially flat gauge to the 
comoving gauge to find the amplitude of primordial curvature fluctuations 

H 



(c 



2\l/2 



More precisely, the two-point function is defined 



(2.4) 



(2.5) 



(2^)^5^(fci + A;2 



{- 



where is the dimensionless power spectrum (variance). 
2.2 Some notation for cumulants 

The power spectrum, or two-point function, is all that is needed to characterize a Gaussian 
field. Here we are interested in non-Gaussianity, which is a catch-all term indicating that 
additional information is needed to characterize the fluctuations. To structure the discus- 
sion, we review some notation for general probability distributions. We use the example 
of the Probability Distribution Function (PDF) for density fluctuations, 6, to review how 
an observable may depend on the structure of correlation functions. This procedure can 
be generalized to apply to other quantities and other statistics, including the Minkowski 
functional |^|. 

For a probability density function P{6)d6, the n-th central moment is 



in 



5'^P{5)d5 



(2.6) 



The n-th cumulant is the connected n-point function and cumulants with n > 3 are zero 
for a Gaussian distribution. The reduced (or normalized) cumulants are defined as 



Then an exact expression for the PDF in terms of the cumulants is given by 



P{5)d5 



d5_l_ 
2-111 (t| 



dyexp 



S{y) 



where is the variance and the generating function 5(y) is defined by 



S[y) 



(-1)"-^ ., 



n=2 



Changing integration variables to y = y/crs, this is 



P{5)d5 



d5 1 



dy exp 



E 

n=2 



(-1) 



n-1 



n! 



-y [OnCTs ) 



(2.7) 



(2i 



(2.9) 



(2.10) 
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Using the saddle point approximation to perform the integral in Eq. (|2.8| ) and collecting 
terms of the same order, one arrives at the Edgeworth expansion: 



1 + Ts-^H^iy) + cji ( f^ii^{y) + ^^eM ) + . . . 



(2.11) 



where v = 6/as and the Hn are Hermite polynomials. Both Eq.(2.10) and Eg. ( p. 11 ) are 



an expansion in what we will call the dimensionless cumulants, 5„cr" ^, so truncations of 
these expressions give a good approximation of the actual PDF if 

1 > |cS3|ct> |54|(j2... (2.12) 

where we have drop the subscript to indicate that this expression is general. If only the 
first few terms in the expansion are kept the recovered probability distribution will be 
approximate and have a limited range of validity. For example, the PDF can develop 
negative regions if only the first Ss term is kept. This discussion has been one-dimensional, 
but we are often concerned with spherically symmetric situations where the fiuctuations 
are smoothed on a scale R. Then all of the quantities above become functions of R, but 
otherwise the expressions are unchanged. Of course, Eq.( |2.1l| ) is also an expansion in z^, so 
the regime of validity of the Edgeworth expansion for large scale structure calculations also 
depends on the smoothing scale (which affects as) and the region of the PDF of interest 
(6). 

An observable like the number of large structures (galaxy clusters) depends on the 
probability of a several-sigma fluctuation, and so is sensitive to the area under the tail of 
the PDF. The potential to constrain a fundamental model of non-Gaussianity by counting 
structures relies on the validity of expanding in the dimensionless cumulants, or on some 
other more exact knowledge of the relevant distribution. 

2.3 Hierarchical structure and the local model 



The local model ansatz for primordial non-Gaussianity is [43| 



C(x) = Cg(x) + pNL [CS(x) - (C^(x))] (2.13) 

where C(x) is the primordial curvature perturbation, Cg(x) is a Gaussian random field 
{{Cq{x)) = (7^) and the amplitude of non-Gaussianity is parameterized by J'nl, which we 
take to be constant for simplicity. This generates correlation functions with hierarchical 
scaling. For the two-point function at a single point in real space to be nearly given by the 
two-point of the Gaussian piece requires 

^/tvl) a^^a^^^\fNLW=^\S3\a^l. (2.14) 

This is the condition that the model is not 'too' non-Gaussian. (Numerically, on CMB 
scales, {(q{x)) = o"^ 10^, so this gives I/nlI ^ 10^/^, which is much weaker than the 
current observational constraint.) Furthermore, higher order cumulants go like powers of 
InlO', so Sn oc (/atlO")"'"^. As long as IJnlW ^ the expansion in cumulants is valid. 
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Non-Gaussianity of the local shape is a feature of multi-field models, but in such 
scenarios Eq.( 2.13| ) is only the first term in a more general expansion 



C(x) = Cg(x) + IfNL [C^(x) - (cS(x))] + gNLCh + ■■■ (2.15) 

The range of possibilities for non-Gaussianity in multi-field models is just beginning to 
be studied systematically, but as suggested in |Q for a curvaton scenario, such models 
are not necessarily hierarchical. Even if each individual field is perturbative and generates 
hierarchical correlations, a fine tuning may allow the skewness of the observed curvature 
perturbation to be nearly zero while the kurtosis is measurably large. In addition, although 
53 is constrained by observation and ^4 is in principle, it has not yet been observationally 
established if \S3\a ^> |54|c7^. 

2.4 Fluctuations and effective field theory 

This section has so far taken a largely observational point of view, which we now want 
to connect to the scalar field theory during infiation. For inflation from a single scalar 
field, non-Gaussianity larger than that generated by the usual slow-roll must come from 
non-trivial kinetic terms (as long as the potential is smooth and nearly fiat). Using the 
parameter fiy^ as a proxy for magnitude of non-Gaussianity, with {/nlI > 5 probably 
observable in the near future, one might guess that {/nlI ~ 1 for general theories of the 



type Eq. (|l.l| ). The reasoning is the following |29|: Consider a Lagrangian P{X,(f)) with 
the infiaton and X = —^gf^'^d^(j)d,y(f). Suppose 

P{X, 0) = -V{^) +X + a,^+a2^ + ... (2.16) 

where the potential energy V{(j)) will drive inflation and M is some scale (like Mp or the 
string scale nis) suppressing higher order operators. If X/M'^ <C 1, this series can be 
truncated. Then if X/M'^ <C 1 and the coefficients are of order 1, Px ~ 1, "^XPxx ^ 1 
and ~ 0{a fewxlO"^). The sound speed was defined in Eq.(2.1) and gives the magnitude 



of the non-Gaussianity through {/nlI oc cj'^ [Q]. This is easy to see by examining the ratio 
of terms in the Lagrangian that are cubic in fluctuations to the free, quadratic part. 

Here we will be interested in the case <^ 1. To see how this regime can be under 
control, we should relate the scale M to other parameters in the theory, starting with the 
relation 

2M^H' 

4> = (2.17) 

where primes are derivatives with respect to (p and over-dots are time derivatives, and 
rewriting in terms of the parameter e 



H _ 2Ml fH' 



we flnd 



/2 

_p_ 

M4 PxM^ 



X eHHil 



(2.18) 
(2.19) 
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With Px ~ 1, this gives 

^ < 1 ^ eH^M^ < (2.20) 

As pointed out in |31], for M ~ y/eMp, the COBE normahzation imphes ^ ^ 4 x 10~^. In 
addition, the condition that the kinetic energy is small compared to the potential energy 
is X <C 3MpH^ ^ ]^ f (m) ' which is small unless e is unusually tiny. 

In contrast, we are interested in the case where is not too small so that the 
derivative interactions are significant. This is more natural when both the Hubble scale 
and the scale M are well below the Planck scale (see Eq. (|2.19D ). Notice that for inflation 
to continue (that is, for potential energy to remain dominant) in the regime where is 
close to one, we must have V{(j)) > X ~ M^. (Although we require that the inflaton mass, 
m, is less than M.) 

With these relations in mind, we now turn to a consideration of correlation functions 
in actions of the fluid type, Eq.(l.l). 



2.5 Structure of tree-level correlation functions for a perfect fluid 

To examine the structure of correlation functions for actions of the P{X, (p) type, consider 
Taylor expanding the action in Eq. ( |1 . 1[) around the background value of the field (f)^, 
ordering terms according to powers of the fiuctuation so that 

5 = So + ^2 + ^3 + . . . (2.21) 

where Sq is the background solution. Si vanishes when the equations of motion are satisfied, 
5*2 is quadratic in fluctuations and is used to write the mode equations for the fluctuations. 
The solution to those equations is used to evaluate the 3-point function coming from terms 
in ^3 and to give a magnitude of non-Gaussianity. We can then determine the interaction 
Hamiltonian i?/, starting with the terms from S^^ and start computing correlation functions 



as outlined in |34]. Correlation functions at time i, taken to be long after horizon exit but 



before the end of inflation, are computed using the "in- in" formalism [45, IC, 34 1. At tree 
level, 

{5<l,{ki,t)...5ct){kn,t))= (2.22) 
i I dt' ([Hi''\t'),64>{ki,t)...S(t>{kn,t)]] 

J —oo 

where Hj"^^ is the term with n powers of the fluctuation. As reviewed in Appendix 
performing this calculation for an action including an term shows that the dominant 
contribution to the three-point correlation of the primordial curvature (when Cg is small) 
has the form 

C(*.)C(fe)C(*s)) ^ (-3, 

X ( ^ J B{ki,k2,h) 
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where B{ki, k2, ^3) is dimensionless with terms of order 1 and terms suppressed by c^, and 
K is the sum of the momenta. 

Since we are just interested in how higher order correlation functions scale with the 
amplitude and the sound speed, we do not actually need to do the full calculation for 
more complicated actions. Instead, we can Taylor expand the action P{X, (p) 32]. The 
Lagrangian density is used below to simplify notation, 



(2.24) 



-P 



XX 



''^ -a-\V5<pf\ + \p^xxx^'' 

D 



+ ^P,xxxx(, 



2i2 



1 

A' 



+ . . . 

2 r ,-• , 2 



+ ^P.xxx^^H [6<P -a-^{V6(l)f] 



+ ... 



Two simplifying relations we use here and throughout the paper are 

d d aH 

dt ' dx Cs 

Then, using the rms value of fluctuations at horizon crossing, inserting powers of ( 
as needed and identifying the combination Vc_ — ^ 



+ 2XPyx 




+ ... 



gives: 



(2.25) 
= 2X 

(2.26) 



+ '^X^P^xxx 



+ 



+ ... 



+ AX-'Pxxx 1 



+ P,XXXX 



+ ... 



where two useful combinations of derivatives have been defined 



S = XPx + 2X'Pxx 



XP 



X 



(2.27) 



A = X'P^xx + ^X'P^xxx . 

As long as the coefficients of higher order terms are not too large (conditions like A < 
3S/2C3), this pattern continues and the dominant contributions at order n in the small 
sound speed limit go like 

n-2 



A„ I (Fx G?L2 



V, 



1/2' 

c 



(2.28) 



-9- 



where An is some constant coefficient. Exceptions to tliis beliavior require fine-tuning, and 
an explicit example was worked out in [57]. 



We can now estimate the integral in Eq.( |2.23 ) using (Ax)^ At ~ {cs/aH)^H^^ to see 
that At(Ax)^a^£2 ~ 1- This estimate assumes that the dominant physical contribution 
to the integral should be near the scale of horizon crossing. Then the n-point correlation 
scales like 

iC) oc (C')""' (c72)«-2 (2.29) 

This is the hierarchical form, but with extra powers of the sound speed. To check if those 
factors are problematic, we note that the dimensionless combination of the magnitude of 
the n-point function scaled by the two-point function, the dimensionless cumulant, goes 
hke: 

(C") /^2\(n-2)/2 



,1/2^ "-2 



So for Cg ^ and for An of order one, the truncated cumulant expansion will give a 
reasonable approximation to the PDF. 

3. Expanding the most general action with higher derivatives 

The observationally interesting case of large non-Gaussianity (small sound speed) requires 
that X/M* be close enough to one that many terms in the expansion are important. 
Although there are special examples like the DBI action where the entire structure of first- 
derivative terms is known, it is interesting to ask how small a sound speed is reasonable 
in the most general single field inflation. The procedure above indicates the perturbative 
approach is valid while cf > P^, but considered only single derivative terms. In pO| , 
the same conclusion was reached considering first derivative terms organized in different 
way. That work, an effective theory of fluctuations, considered higher derivative terms in 



operators depending on the extrinsic curvature and its derivatives. The approach of ||3^ is 
very useful for understanding a variety of cases, but in most inflationary models it is the 
effective theory of the inflaton itself that is uncovered term by term. With that in mind, 
we consider the first few steps of an expansion of the action in powers of derivatives. 

The generic expansion to fourth order in derivatives was recently carried out by Wein- 
berg ||3l| , who found that the only new term generically present in the action^ has the form 
fi{(f>)^- To arrive at this simple answer, the quadratic equation of motion was used to 
replace Dcp = V'{(f)), which ensures that there are no extra propagating degrees of freedom. 

3.1 Expansion at six derivatives 

From an effective field theory point of view, one non-renormalizable operator cannot be 
added to the theory in isolation. Continuing to six-derivative terms and using integration 



^At four derivatives, this is the only important term for scalar fluctuations even after terms involving 
metric derivatives are considered. We are not repeating the gravitational part of the analysis for six 
derivative terms, but we do expect some terms involving the metric to remain, suppressed by M /Mp. 
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by parts to remove redundancies, there are eight independent terms involving derivatives 
of the scalar field (ignoring new terms involving derivatives of the metric, which are likely 
to be suppressed by powers of Mp rather than M). They have the following structure: 

x3 , x^u<i, , x{U4>f , (□0)3 , 0n30 , x<7'^-(a^0)(a,n</.) , (3.1) 

Notice that the only term involving two fields may be written as powers of the box operator, 
and that there are only two terms that do not involve box. 

In the expansion up to four-derivative terms, using the lowest order equation of motion 
in the Lagrangian conveniently removed second derivatives of (p and recovered the simple 



structure of the 4> propagator |^. Since at any order in derivatives, the only term that is 
quadratic in (j) can be written as i;^n"(/>, eliminating box in this simple way ensures that the 
(j) propagator keeps a single physical pole, with corrections in powers of m/M. Substituting 
for the box operator in the same way in the six derivative terms adjusts the coefficients of 
lower order terms in the action. Although we will generate higher order derivative terms 
at each stage that will not be removed, these are still small perturbations and so we do not 
worry about having additional degrees of freedom. Then, up to terms with six derivatives, 
we may write 



X 



+8h{cP/M)^+F{R,4 

where the fi{(j)/M) are dimensionless, order one functions and the last term is a placeholder 
for terms involving more than two derivatives, with at least some derivatives acting on the 
metric and scaling with Mp ^ M. We will examine the contribution of the two new 
six-derivative terms to the correlation functions in the next section. Note that the sound 
speed defined in Eq.( ^.lD is defined from the background solution and so does not receive 
a significant correction from the higher derivative terms. 

Before we continue, we comment that one can imagine being more rigorous about 
eliminating higher derivative contributions. For a classical point particle Lagrangian, there 
is a systematic procedure for obtaining a Hamiltonian H{(f)) by replacing higher derivatives 
by functions of the field and its first derivative as long as the higher derivatives can be 
considered perturbations [47, H^. This ensures that the only solutions to the equations 



of motion are consistent with perturbative corrections to the free field solutions, and only 
two initial conditions are needed. This idea is built in to the field theory structure that 
requires the interaction part of the Hamiltonian to be in some sense small, and in the next 
section we will find the answer suggested by the point particle example - as long as higher 
derivative terms enter as perturbations there is a regime where their contribution, both to 
the background and to correlation functions of fluctuations, is small compared to single- 
derivative terms, even if one has reason to keep an entire power series of single derivative 
terms. 
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4. Estimating the higher derivative contributions 



Truncating the action P{X, (p) assumes that X/M'^ <^ 1. Any truncation obviously leads to 
a truncation in correlation functions. Including terms up to (X/M^)" (and ignoring terms 
from the potential, which are less important) generates correlations up to We saw 

above that < Cg was required for the cumulants to be ordered in the case that only first 
derivatives are considered. Expanding a more general action up to p derivatives allows one 
to calculate up to ((50)^). We can estimate the relative size of the various contributions to, 
say the three point function. Replacing (f) = (f)() + 6(f> in the action Eg. ( [3. 21) and collecting 
terms with three powers of the fluctuation, we have 

53(<^) = f d4xy=^|4/i(<A/M)^[(<5-<A)3 -a-2(5,^)(V<5</>)2] (4.1) 



-2/2(0/M)^ 



+^2^^Mm?-a-\V5ct^f] 

+shm/M)^^^^[mf - 2,a-\mv64>f] + . . . | 

Using d/dt ~ H and d/dx ~ aH/cg-, we immediately see that the contributions to the 
three point function with the strongest Cg dependence go like 



Hf^ (X j dt a^{5(t)f 



M4 c2 



4A + 2A^+24/3|j 



(4.2) 



In the limit of very small sound speed, X/M'^ — )• 1 and so the contributions from all powers 
of X/M^ in the action will be equally important for the correlation functions. These 



contributions can be collected by writing the action as a Taylor expansion, as in [32| or 
implicitly in |]30t| . We also see that while H/M < Cs, contributions from higher derivative 
terms are suppressed relative to single derivative terms. This assuming no parametric 
dependence on H/M, etc in the coefficient, which in any case is likely to further suppress 
the term. 

When is H/M < Cs"! We have assumed that X/M^ < 1. If the previously derived 
condition < cf also holds, we see that 

X eH'^Ml ^ ^ 

For DBI, P^xCs = 1; and we have exactly H/M < Cs- The expansion to six derivative 
terms as written above gives P^xCs < !> and it has been argued that in more general cases 
one can set P^xCs = 1 as a gauge choice (observations are not sensitive to P^x alone.) 
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1 42]. In the perturbative case then, correlation functions of a general scalar field action 
are dominated by contributions from the P{X, (/>) part, assuming the coefficients of powers 
of X/M'^ are the renormalized coefficients, and those correlations are hierarchical. When 
the perturbative condition breaks down, not only are the dimensionless cumulants from 
single derivative terms no longer a converging series, but all higher derivative corrections 
are equally important. Continuing to more derivatives in the action can only bring in more 
copies of X/M"^ or H/Mcs, so, up to gravity terms, we are done. We see that X/M^ < 1 
and < Cg imply H/M < Cg, and as both the first two inequalities are both saturated, so 
is the third. 

5. Example: the DBI action 

5.1 The lowest order brane action in an AdS background 

There is a very interesting scenario with ^ 1 that is a useful test case for effective 
field theory ideas (and where one might hope to study the non-pertrubative regime of very 
small Cg). The Dirac-Born-Infeld (DBI) action contains an infinite series of powers of XjM'^ 
summed to a square-root that enforces X/M'^ < 1. In the case of a D3 brane moving down 



a warped throat M is the warped string scale, where the warp factor depends 

on the position of the brane in the background AdS geometry. The position-dependent 
warp factor translates into a scale M that decreases during inflation and a scale-dependent 
non-Gaussian signature in the resulting density perturbations (which may be observable 
fSlll ). The radial brane position is r, with Vmin < r < R where R (large in string units) 
is the scale of the throat and r^m is the parameter giving the minimum scale where the 
throat smoothly ends. Then (a very simplified version of) the lowest order brane action is 



S = - (fx a 



T3h{r)^/1 - Ph-^{r) - Tshir) + 1/(0)1 (5.1) 

Here T3 = [{2iT)^gsa''^]~^ is the three-brane tension in terms of the string coupling gs < 1 
and the string length a' (x l'^ = mj'^. We have assumed the 10-dimensional metric takes 
the form^. 

dslo = h-^l\y)g^, dx^dx" + h^'\y)gmndy'^dy'' , (5.2) 

where the warp factor h depends only on the coordinates y of the extra dimensions and 
the four-dimensional metric g^y is Friedmann-Robertson- Walker with scale factor a. 

We consider only potential energy dominated cases, so that the Hubble parameter 
during inflation H = d/a is given by 3MpH'^ w ^(0) to a good approximation. For a 
purely AdS background, the warp factor is h{r) = R'^/r^, although the throat may smooth 
out and have a region of constant h{r) = ho near the minimum r^^n ||54|| . The canonical 
inflaton is (p = VT^r, so that the action in terms of (p (and restoring covariance) is 



S = - d^xa' 



/(0)v/l-2A/-i(0) - /(0) + F(0) (5.3) 



^This has not been shown to follow consistently as a supergravity solution, and there are some indications 
that something is missing in this description [ pi] , ^ . However, that is not likely to be relevant for the point 
we are making here. 
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where f{(/)) = T^h "^{(p) = M'^{(p). In this case, the M decreases as (j) does, and the sound 
speed decreases hke Cs oc 4? /M^ [Q. This is plotted below. So, at this level, if the ratios 




Figure 1: For a brane moving in an AdS background geometry, the sound speed continually 
decreases until the geometry changes near rmin- If Cg gets too small (cj^ > V^^^^ ~ 10^/^), the 
perturbative calculation of correlation functions based on the lowest order brane action is no longer 
valid. 



H/Mp (which sets the magnitude of 7-"^) and rmin/R (which determines the minimum Cg) 
are taken to be independent (i.e. controlled by separate microphysical parameters) it looks 
like one may run into a regime with < Vc, in which case the effective theory based 



on the lowest order action breaks down |32|. Of course, we might hope that the stringy 



picture indicates what physics is missing, and it may be that inflation continues even past 
this point. We will use the discussion of higher derivative terms to pursue this idea a little 
further. 

5.2 Higher order corrections 

Eg. ([5. 3D is the lowest order action for the brane. Corrections to the fluctuation spec- 



trum that depend on higher derivatives of the position can be organized as the extrinsic 
curvature, K^y, and its derivatives. In fact, this is a useful organization in the general 
theory of fluctuations [^0| . The extrinsic curvature of the brane due to fluctuations on the 
homogeneous background (9j(/>o = 0) is 



where we have dropped small corrections due to the variation in the warp factor with (j). We 
can verify that in the perturbative regime the curvature is smaller than the local warped 
string scale since 



H V H 



as long as H/M < Cg, < 1. This is good, since if the radius of curvature was less than 
M~^) the brane would be unstable to decay by self-annihilation in the fluctuating regions. 
Eq.( |5.5D also demonstrates that the gradient energy is not yet so large that inflation ends, 
since < V{(t)). However, it is an interesting dynamical question whether or not caustics 
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may develop on a time scale much shorter than the Hubble time |55|. To address that 
question accurately, we would need to know the form of higher derivative terms in the DBI 
action (although we note that self-annihilation of the brane due to folding cannot be whole 
answer, if it happens at all, since D3 charge must be globally conserved). 

We can speculate that the DBI action in a background that gives rise to inflation is 
corrected by higher derivative terms as 

SDBI,gness = " / f {ct>) VT^^2X/W X [l + ^'""'^^ (X/M^) (9^9/0) + . . 

(5.6) 

If the first term in J^{X/M^) is proportional to X/M^ and contains the index contraction we 
used in Eg. ( p. 21 ), this guess resembles what would be obtained by T-duality from the known 
corrections in derivatives of the field strength in the bosonic action (the first correction to 



the superstring action is at eight derivatives) [56|. The multiplication by the square root 
forces the entire derivative structure to vanish as — )• 0. In addition, if all higher derivative 
terms enter multiplied by powers of X (as suggested by the form of field strength derivative 
terms), the higher derivative structure vanishes when the velocity is zero. 

The action suggested by brane dynamics naturally has a more restricted structure 
than the EFT structure considered in |^0|. There the most general effective action for 
the fiuctuations of a single scalar field is organized into contributions to the n-point func- 
tion from a summation of operators with powers of X/M^, and, separately, the extrinsic 
curvature (higher derivative) terms. The extrinsic curvature terms were assumed to be 
independent of the terms proportional to powers of X. Here the symmetries that control 
the square-root summation (conformal invariance, supersymmetry) also control the struc- 
ture of higher derivative terms. More generally, if we believe we know how to sum all (or 
enough of) the operators contributing to, say, the three point to understand a measurably 
large non-Gaussianity, we may also know the structure relating those terms to the extrinsic 
curvature terms. 

The action in Eq.( |5.6D is still not complete, and in particular does not contain terms 
with curvature corrections or couplings to the background fields generating the brane po- 
tential. To answer the question of what happens for brane positions so far in the IR end 
that Cg < P(^, we must consistently include these other corrections. Since this is specific to 
the string theory scenario, we will not pursue it further here. 



6. Conclusions 

Future measurements of non-Gaussianity (even with a null result) can provide unique 
information about physics at the highest energy scale accessible, but the utility of that 
information depends on our ability to translate fundamental models to observation. A 
feature of that translation, beyond the shape and amplitude of the three-point function, 
is the expected structure of correlation functions. Here we have demonstrated explicitly a 
very natural result: that the most general, perturbative, single-field inflation models that 
generate large non-Gaussianity from higher derivative terms have a hierarchical structure. 
That is, as long as the sound speed is large enough that perturbative calculations can 
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be trusted, the statistics can be captured by expanding in moments around a Gaussian 
distribution. The perturbative condition (which assumes X/M^ < 1) takes two useful 
forms: 

4 > P( -or- ^<cs (6.1) 

Derivative interactions are relevant (and non-Gaussianity significant) if the scale of the 
effective theory is not too far above the Hubble scale. While it may seem fine-tuned to 
consider this coincidence of scales (rather than M ~ Mp), it is sometimes a trade-off from 
fine-tuning the flatness of the inflaton potential (which was the context in which this effect 
was originally introduced |49]). These results assume order one coefficients for all terms in 
the action, so fine tuned single field or multi-field examples can be exceptions. A multi-field 
model that is not hierarchical is given in [44|, and the fine-tuning for single field examples 
is discussed in [^]. 

A significantly large non-Gaussianity is easiest to explain if we know something about 
the UV structure of the theory. One of the few known examples is based on the DBI action 
for a D-brane, where higher derivative corrections may also be calculated. In that case, it is 
likely that the same symmetries that lead to the square-root summation of first derivative 
terms also control the at least some of the structure of higher derivatives. 

Investigating the behavior of a general scalar field shows that actions of the A;-inflation 
type (actions of the P{X, (p) type) are not sufficiently general beyond four derivative terms, 
although the difference in the perturbative regime is very small for the background solution 
and down by a factors of H/Mcs in the correlation functions. Interestingly, a simple 
extension of P{X, (p) actions to multiple fields is also modified at four derivatives and in 
that case the deviation appears to be more observationally important |58]. 
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A. Calculating correlation functions 

In this section we calculate the 6(j) correlations and use the first order gauge transformation 
to find the leading contribution to correlations of C (the quantity that is constant outside 
the horizon). This is by no means a complete calculation of C correlations, but it does 
correctly capture the behavior of the leading term in the small Cg limit. This can be 



checked against the calculation of |5£], which works out the complete trispectrum for a 
general sound speed model. 

A.l Statistics of the action expanded to four-derivative terms 

Weinberg [31| has argued that the general form of the effective action up to terms with 
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four derivatives, and for the moment ignoring gravity, is 

X2 



x-y(<A) + 4/i(0)^ + ... 



(A.l) 



where /i(</>) is a dimensionless function of 4> and X = —^g'^'^d^cpducj). Expanding in terms 
of the fluctuation -k = 54)/(t)o and labehng the background piece Cq, this is 



£ = £o + a^M;H{-iT^ + a-\V^Y) (A.2) 



From this, we can compute the tree level three and four point functions of tt. Quantizing 
the fluctuations 

j^Hk, i^)a{k)e^^-- + n*(A:, r?)at(A;)e-'=-] (A.3) 
where [a[k),a) (k')] = (27r)'^J^(A; — k') and from the quadratic terms 

^(^'^) = , (l + iA:c,r?)e-^^--'^ (A.4) 

^J2csP,xk'^ 

Then from Eq.( |2.23| ), and taking 77 — )• 0, 



,5\ki + k2 + k^ + ki) ( h\ ( 

cik2k3k4)K^ \8My y^^csP;^^ 

where K = ^^ki, and B{ki,k2,k-i) and Q{ki,k2,k3,k/C) contain terms of order 1 as well 
as terms suppressed by powers of c^. At this order, we may use ( = —Hit. We also use the 
relations Vt = — „ and 

^.^ (4 " ^) = (A.6) 
I6/1 



M4 

Then for a general action P{X, (p) we can write (absorbing numerical factors into the 
functions of momenta) 

where terms in B and Q ^-^e 0(Cg, c^, cf . . . ) and so the term with the most powers of Cg 
in the denominator scales as we expected: oc T'^'VlCs)""^- 
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A. 2 Statistics of the action expanded to six-derivative terms 

Expanding the action up to six derivatives introduces a 5-point and 6-point function and 
gives tree-level corrections to the three and four-point correlations calculated above. The 
solutions for the Hubble parameter H{(j)) is very nearly unchanged since the kinetic terms 
do not dominate the energy density, but the relationship between the sound speed and 
the function /i does change. For terms other than the leading one, we can no longer use 
just the first order expression relating 5(j) and (" or the approximation vr = 5(f) /(j)Q. For 
that reason, we will only write the contribution from the dominant term. A more precise 
calculation of the four-point correlation for P{X, (j)) models, which also covers the case 



where derivative self-interactions do not dominate, can be found in [59|. 

To find the additional contribution from A'^/M^, one can replace the functions fi by 
combinations of the derivatives P^x, P,xx etc. appropriate for each order of the expansion. 

The dominant contribution from the new term containing second derivatives is 



C(A:i)C(A:2)C(A:3; 



3) 



B{ki,k2,h) 



{kik2k^)K^ 



B{ki,k2,k 




/i + 3/3(A/M4) 



where we have used the first line of Eq.(A.7) and 

8/i , 48A/3 



'XX 



M4 MS 

For fi{(j)/M) and fz{4>/M) order 1, this is the coefficient we anticipated in Eq.(|4.2|). 



(A.9) 
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